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Abstract — Power efficiency of noncolierent receivers for 
impulse-radio ultra-wideband (IR-UWB) transmission systems 
can significantly be improved, on the one hand, by employing 
multiple-symbol differential detection (MSDD), and, on the other 
hand, by providing reliability information to the subsequent 
channel decoder. In this paper, we combine these two techniques. 
Incorporating the computation of the soft information into a 
' single-tree-search sphere decoder (SD), the application of this 
, soft-output MSDD in a typical IR-UWB system imposes only a 
I moderate complexity increase at, however, improved performance 
■ over hard-output MSDD, and in particular, over conventional 
symbol-by-symbol noncoherent differential detection. 

I. Introduction 

Impulse-radio ultra-wideband (IR-UWB) is widely consid- 
ered as a promising technique for low-power low-cost short- 
range wireless communication systems. One of the main 
reasons for this is its potential to employ noncoherent, hence 
low-complexity, receivers even in dense multipath propagation 
scenarios, where channel estimation required for coherent 
detection would be overly complex due to the high multipath 
resolution and relatively large delay spread of UWB signals. 

The performance penalty between coherent and noncoherent 
detection in power efficiency, i.e., in the required signal-to- 
noise ratio to guarantee a certain bit error rate (BER), can be 
closed by replacing conventional symbol-by-symbol noncoher- 
ent detection with a joint detection of a block of symbols, i.e., 
performing multiple-symbol differential detection (MSDD) 
ID, 121 ■ In El it has been shown that the underlying tree search 
problem is efficiently solved using the sphere decoder (SD) 
algorithm. However, noncoherent (MSDD-based) IR-UWB 
receiver design has mainly considered uncoded transmission 
systems, cf., e.g., |[T], iS, H. 

In this paper, we consider coded IR-UWB transmission 
employing differentially encoded BPSK (also known as differ- 
ential transmitted reference (DTR)). To keep transmitter and 
receiver design simple, we restrict to the conventional serial 
concatenation of modulation and coding at transmitter, and 
detection and decoding at receiver side, i.e., restrain to the bit- 
interleaved coded modulation (BICM) philosophy. Employing 
coding, power efficiency can significantly be improved by 
delivering reliability information to the soft-input channel 
decoder Q. Borrowing from techniques recently introduced 
for SD-based soft output generation in multiple-input/multiple- 
output (MIMO) systems ||6l, we extend the SD-based MSDD 
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algorithm presented in 131 to incorporate also soft output 
computation. 

To this end, in Section |II] we derive log-likelihood ratios 
(LLR) for MSDD of IR-UWB based on generalized-Hkehhood 
ratio testing (GLRT), and formulate their computation as a 
tree search problem. Using the soft-output sphere decoder 
(SOSD) the LLRs can be found in a single tree search, thus 
receiver complexity is increased only moderately, especially 
in comparison to repeated-tree-search approaches, cf. ||6]. 

In Section |III1 we investigate the performance of the pro- 
posed MSDD-based soft-output IR-UWB receiver and study 
the tradeoff between performance and complexity obtained 
by adjusting the channel code, the MSDD block size, and 
techniques for SD complexity reduction, such as a stopping 
criterion Q and LLR clipping |]6|. We conclude with a 
summary in Section |IV] 

II. Impulse-Radio Ultra-Wideband Transmission 

A. System Model 

The receive signal of differentially encoded BPSK IR-UWB 
is given as (cf. Fig. [U 



+ CXD 



it) = J2 M(i - iT) + n{t) 



(1) 



where hi are the differentially encoded, interleaved (11) and 
mapped (M.) output symbols ai € A = {±1} of a channel 
encoder, such that bi = 6o 111=1 '^k, with 6o — 1, and T is the 
symbol duration. The receive pulse shape p{t) is obtained from 
the convolution of transmit pulse, receive filter, and channel 
impulse response, i.e., p{t) — h^^{t) * h^^{t) *p^^{t). The 
pulse energy is normalized to one and thus, the energy per bit 
is given by E'b = 1. n{t) is white Gaussian noise of two-sided 
power-spectral density Nq/2 filtered by h^^{t). To preclude 
inter-symbol interference, the symbol duration T is chosen 
sufficiently large, such that each pulse has decayed before the 
next pulse is received. Note that the usually applied frame 
structure used for time-hopping and code-division multiple 
access [Sl, ||9l is not explicitly taken into account, as it can be 
regarded as additional linear block coding, or removed prior 
to further receive signal processing ||3]. 

B. Soft Output Generation 

The reliability information for a single information symbol, 
which is passed as soft input to the channel decoder, is 
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Fig. 1. Block diagram of the coded IR-UWB transmission system with L-branch ACR and soft-output MSDD. 



calculated based on the observation of the receive signal in 
an {L + l)-symbols interval < t < {L + 1)T (without 
loss of generality we consider the interval starting at t = 0). 
The channel is assumed to be constant in this interval, which, 
in typical indoor UWB communication scenarios, is fulfilled 
especially for moderate L ifTOl . 

Due to the unknown statistics of the receive pulse shape 
p{t), we revert to the GLRT approach, thus, in contrast to the 
ML criterion, include an explicit optimization over all finite- 
energy pulse shapes ■p{t) of some assumed duration T\ IS], 
Il3], IE). In terms of LLRs for the i'*" symbol, i = l,...,L, this 
reads 



LLRi = log 



/ maxp(t) Prjaj = +l\r{t),p{t)} 
Vmaxp(4) Pr{a^ = -l\r{t),p{t)} 



(2) 



Defining the vector of the information symbol hypotheses 
a = [ai, ...,0,]^], applying Bayes' rule, and assuming inde- 
pendent i.i.d. information symbols, i.e., a-priori information is 
not considered as feedback from the channel decoder, we have 



LLR, = log 



Eae^^ a =-i maxp(t) f{r{t)\d,p{t)) 



(3) 



where f{r{t)\d,p{t)) is the probability density function of the 
additive noise in ([T]i. As f{r{t)\d,p{t)) > 0, maximization and 
summation could be interchanged. 

In a first step, we perform the maximization over the 
unknown pulse shape, applying similar steps as shown in |3 |. 
This is summarized, very briefly, as follows. 

We define the noise-free receive signal hypothesis 
s{t\d,p{t)) = Y.f=o^ol[k=i^kP{t - iT), including the re- 
ceive pulse shape hypothesis p{t). Since additive Gaussian 
noise is assumed, which is bandlimited by a receive filter 
with equivalent noise bandwidth i?eq, and relying on the 
equivalence of distance in signal space and energy of the 
difference signal ifTTl . we arrive at the joint probability density 
function 



f{r{t)\d,p{t))=Cf 



Jo 



{r{t)-s{t\a,p{t))y dt 



(4) 



where cr^ = NqB^^, and Cf is an irrelevant constant, which 
cancels in ([3]). Applying variational calculus and using the 
fact that (ii G {±1}, the optimizing pulse shape is found to 
be popt(i) = i:TT EtobalVk=la^r{t + iT) (cf. for a 
similar derivation). Thus, 



-2r(a)-Ei=o 



maxf(7-(t)|a,p(t)) 
Pit) 



Cf ■ e 



(5) 



where we defined 

L i — l i 

r(«)-EE n (6) 

i=i i=n k=i+i 
and, for i = 1, i, Z = 0, i—l, 

Zi,, = / r{t + iT) ■ r{t + IT) dt . (7) 

"'0 

Using the result in Q, as well as applying the max-log 
approximation, (|3]l can be approximated by 



LLR,7 — 



1 



al{L + l) 



max r(a) — max T{d) 



C. Autocorrelation-Based Detection 



ai = -l 



(8) 



Before we turn to an efficient implementation of the compu- 
tation required for ([8]l based on the SD algorithm, we note that 
Zi,i, defined in O, corresponds to the output of an L-branch 
autocorrelation receiver (ACR, shown in Fig. [U with delays 
being multiples of T and integration interval T\, set in the 
order of the expected receive pulse duration. Z;_j represents the 
phase transition from hi to hi superposed by an "information 
X noise" and "noise x noise" term. 

Clearly, restricting to a single-branch ACR, i.e., L = 1, 
corresponds to symbol-by-symbol differential detection (DD). 
In this case, it can directly be seen that the hard-quantized 
ACR output gives the DD estimate, i.e., af-* = sign 
However, using ([8]) with L = 1, also the rather intuitive result 
follows, to use the unquantized ACR output for soft-output 



DD, in particular LLR^'^ = ■^[2Z., 



D. Soft-Output Sphere Decoder (SOSD) 

For efficient implementation based on the SD algorithm, 
we reformulate the maximization problems in ([Sjl into equiv- 
alent minimization problems. Since di e {±1}- r(a) < 
J2i=ijyiZo\^i-i\ holds Va. Subtracting this upper bound 
from both objective functions in ([8]) yields 



LLRi 



ctUL + 1) 



mill A(a) — min A(a) 

aeA^ a.eA'^ 
.di — — l d^— + 1 



(9) 



Note that max(a; — y) = — min(y — x), and 



i=\ 1=0 



k=l+l 



Eventually, it can be seen that the addends of the outer sum 
in ( fTOb are always non-negative and depend solely on the 
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first i (preliminary) decisions of information symbols dk, 
k = 1, i. This allows to check the decision metric compo- 
nentwise, and thus each of the two minimization problems in 
(|9|l can be solved using the SD operating on an L-dimensional 
binary tree (see Fig. |2] and also [31 for details). 

Clearly, one of the two minima in (|9]i is the GLRT-optimal 
metric 



A 



MSDD 



A(a 



MSDD\ 



min A (a) 

deA^ 



(11) 



whereas the other one is obtained from the corresponding 
counter hypothesis, i.e., the minimum metric with the restric- 



tion ai = -a^5°°, such that 



MSDD 



Consequently, we have 
1 



mm 



A(a) 



LLR, 



\iL + l) 



MSDD 



A 



MSDD 



A 



MSDD 



(12) 



(13) 



1) Single-Tree-Search SOSD: Calculating the LLRs resorts 
to finding the minimum of an unrestricted tree search, the 
corresponding sequence, and L "next-best" minima. One could 
solve these minimization problems subsequently by rerunning 
the SD for each counterhypothesis with correspondingly re- 
stricted search space. This requires to run the SD L + 1 times 
per block of L information symbols, and hence, imposes a 
high complexity burden. 

This can be alleviated by a modified SD algorithm, as 
introduced in fV2\, and further refined in Q, for soft-output 
signal detection in MIMO systems, which ensures that every 
node in the search tree is visited at most once. We incorporate 
these MIMO-SD modifications into the SD for MSDD of IR- 
UWB as described in |3J (for a detailed description of the 
MIMO-SOSD, cf. ID). Thus, the required A^^dd^ A^^dd^ 
and a^^'-"-' result from a single tree search process. 

First, the SD search radius R is not updated, whenever 
a new (preliminary) best sequence has been found, but the 
searc h radiu s update is based on the current values A'^^'-"-' 
and A^^'-"-', i = 1, L. It is set such, that only branches in 
the search tree are considered, which can lead to an update 
of either A^^dd^ ^j. /^msdd^ ■ ^ j^^^ achieved by 

setting 



i? = max<! max A^^dd^ 

/c— z,...,L 



max 
i=i,...,i-i 

with ai^af^°° 



^MSDD^ 



(14) 



Further, in the case a sequence d with path metric A(a) is 
investigated, i.e., a leaf in the search tree has been reached, 
two cases are distinguished: 

(i) If A(a) < A'^^'-"-', a new (preliminary) best sequence 
has been found. Then all Af^, i = 1,...,L, where 
di = -a^^SDD^ ^j.g ^MSDD^ followed by the usual SD 

update of the current best sequence a'^^'-"-' d and metric 

^MSDD _ 

(ii) If A(a) > A'^^'-"-', only the counterhypotheses have to be 
check ed, i.e ., all Af^, i = where di = -a^^^DD 
and Af SDD > ^~y 



2) SOSD Complexity Reduction: A reasonable measure for 
the search complexity of the SD is the number of visited nodes 
CsD in the search tree during the tree search process, which 
is directly related to hardware implementation complexity, cf., 
e.g., IS). In this paper, we adopt this complexity measure. 

In Q we introduced a packing-radius-based stopping cri- 
terion for the SD in hard-output MSDD of IR-UWB, which 
reduces the average search complexity, yet ensures optimality 
of the estimated sequence. We also apply this stopping crite- 
rion for the SOSD. If any preliminary sequence d with path 
metric A(a) fulfills 



A (a) < i?5top = L ■ min \Zi^i\ 

i—l,...,L 
l=0,...,i-l 



(15) 



the search process is terminated early. In Q it has been shown, 
that this setting guarantees to find the GLRT-optimal sequence 
Q^MSDD ^jjj ^MSDD jjowever, th is doe s not ensure to find the 
optimal solution to (fT2l i. i.e., A^^'^'-', and hence the correct 
max-log-approximated LLRs. However, as will be shown later, 
this stopping criterion enables a reduction in the average search 
complexity at only minor performance degradation. 

As shown in ||6l|, a crucial part for complexity reduction of 
the SOSD in MIMO detection, is to Umit the maximum LLR 
values during the SD search process. For MIMO detection this 
LLR clipping enables a tradeoff between the power efficiency 
of optimal soft-output detection and the complexity of hard- 
output detection. Since we aim for a similar tradeoff, we 
apply this LLR clipping with maximum LLR value LLR^ax 
during the search process, too. Thus, after each update of the 
counterhypotheses metrics A^^'-"-', those are limited to 



A 



MSDD 



max |a,^SDd^ ^^msdd + ^2 + 1)LLR„ 



(16) 

From (O it can be seen that this LLR clipping limits the 
search radius to i? < A^^°° + al{L + l)LLR„^^ and, together 
with ( fT3l l. ensures that |LLRi| < LLRmax after the SD search. 
Clearly, with LLR^ax = hard-output SD-based MSDD 
(HOSD) results. 

3) SOSD Algorithm: The resulting algorithm, based on 
the SD for MSDD of IR-UWB 13], is given in pseudo-code 
representation in Fig. |2] The modifications for soft output 
generation, LLR clipping (cf. line 19), and the stopping 
criterion (cf. line 15), are highlighted in gray shading. Note 
that this pseudo-code representation is based on the pseudo- 
code representation of the SD for MSDD of DPSK given in 
IIT3I . We introduced the counter n;, which is used to check 
if the two branches emanating from each node have been 
checked, the branch metric 

\Zu\ I l-sigii(Z,,,) f[ flfc J J (17) 
, k=i+i ) ) 

and the path me tric A , at a node in depth i. We used the short- 
hand writing AfSDD ^ ^^^^ ^ ^2(^ ^ l)LLRn,ax, 
and omitted the tilde for a hypothesis. The sub-functions 
f indbest and f indnext have been introduced (fl and f2), 
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[A 



MSDD fA 1 „MSDD1 _ 

, [-lH\i=l..L,0, J — 

stop 1 ^^n 

2: 
3: 
4: 
5: 
6: 
7: 



9: 
10 
11: 
12: 

13 

14: 

15: 
16 
17: 
18 
19 

20 

21: 
22: 
23: 
24: 
25: 
26: 
27: 

28: 



^MSDD _ _ _ 0; i := 1 

[Al];=i..L — +00 

[ai,(5, Tii] := f indbest(i, []) 
while i / { 
Ai — (5 + A,_i 
if A, < J? { 
if i 7^ i I 

i I + 1 // move down 

[ai,S,ni] := f indbest(i, a) 
} else { 

if A, < A"s°° { 

A. := A^S'=°, Vi with a. 7^ ar°° 

// store best point so far ... 
// and update best radius 



^MSDD ._ 



if A'^^°° < R,top { break and return } 
I ^j^^ I // stopping criterion, cf. (Tsj 

A, := min{A,, A;}, Vi with / aT^°° 

} 

A. - max{A„A"=°° + A^ax}, Vi 

// LLR clipping, cf. Gill 
i :— i — 1 II move up again 

[i,ai,5,ni\ := f indnext(j, a, n^) 

} else { 

i ~ i — 1 II move up 

[i,ai,(5, rii] := f indnext(i, a, n^) 

i? := max{maxfc=i,,.,z, Afc, max,^;^^ ^j^i^ „j_^„msddA,} 
I // update search radius, cf. jl4b 



function [ai,5,ni\ = f indbest(i, a) 



29: [fl-1:] a, := 



30: [fl-2:] 5 ~ Ello l^'-l U ^ sign(2-i.,) ni- 
31: [fl-3:] m := 1 



■i+i 



a.k 



II cf. 



function \i, aj, S,ni] = f indnext(i, a, n^) 

32: [f2-l:] while rii == 2 | i := i — 1 | // check constellation 
size 

33: [f2-2:] a; := — // cycle through constellation 
34: [f2-3:] S := Ejlo l^'.-l (l " sign(^i,») HLi+i Ofc) cf. 

35: [f2-4:] := + 1 

Fig. 2. Pseudo-code representation of the SOSD algorithm for soft-output 
MSDD of IR-UWB. Gray-shaded: modifications for soft output generation. 

to account for branch enumeration according to the Schnorr- 
Euchner search strategy |[T4l . 

III. Performance and Complexity 

We evaluate the performance and complexity of the pro- 
posed coded IR-UWB transmission system in a typical UWB 
scenario, where we assume no inter-symbol interference {T 
chosen sufficiently large), and p^^{t) is a Gaussian monocycle 
with 2.25 GHz center frequency and 10 dB bandwidth of 




101og(£;b/iVo) [dB 



Fig. 3. BER vs. Ei,/No in dB of coded IR-UWB with soft-output MSDD 
using the SOSD without (red) and with stopping criterion 1151 (dashed green), 
and hai'd-output MSDD (blue) for different L, in comparison to soft and hard 
DD (L = 1, black, dashed black), coherent detection (left most, black), and 
uncoded transmission (coh. and DD: dash-dotted gray). Rate-1/2 conv. code 
with i/ = 6, LLRmax = 10, IEEE-CM 2. 




□ . BDE1C3] 



10 



11.5 



12 



12.5 



13 



13.5 



i?r/A^o [dB] 



Fig. 4. Tradeoff performance vs. average SD complexity at BER = 10 ^ 
for LLR clipping levels LLR^ax = [0.05 0.1 0.25 0.5 1 2 10] (right to 
left) of MSDD of IR-UWB using the SOSD (red), the SOSD with stopping 
criterion (15) (green), and hard-output MSDD (blue) with different MSDD 
block sizes L. Rate-1/2 conv. code with u = 6, IEEE-CM 2. 

3.3 GHz. The propagation channel is modeled according to 
IEEE-CM 2 IfTOl with each realization normalized to unit 
energy. The receive filter is matched to the transmit pulse shape 
and a good compromise for the integration time is T; = 30 ns. 
We use maximum-free-distance rate-1/2 convolutional codes 
with 2" states, an interleaver size of 1000 information bits, 
and the Viterbi algorithm as (soft-input) channel decoder ifTTI . 

Fig. [3] shows the BER performance of the coded IR-UWB 
system employing the proposed SOSD-based MSDD. With in- 
creasing MSDD block size L performance improves compared 
to DD and approaches (hard-decision) coherent detection. 
Soft-output MSDD (using SOSD) achieves an additional gain 
of up to 1 dB over hard-output MSDD (LLR^ax = 0). 

Adjusting the LLR clipping level LLRmax in SOSD, 
a tradeoff between the power efficiency of GLRT-optimal 
max-log-approximated soft-output MSDD and hard-output 
MSDD is achieved, which almost continuously traverses the 
performance-complexity plane (cf. Fig.|4]i. Additionally setting 
the stopping criterion (cf. ( fTSl l and line 15 in Fig.|2]i to enable 
early termination of the SOSD search process, especially for 
small to moderate L reduces the average SD complexity at 
only minor loss in performance. 

However, using the (soft-input) Viterbi algorithm for chan- 
nel decoding imposes an increase in overall receiver com- 
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plexity depending mainly on the number of states 2^ of the 
convolutional code till . Hence, the question for the optimum 
tradeoff between power efficiency and overall receiver com- 
plexity, obtained from adjusting the major parameters code 
constraint length, MSDD block size, and LLR clipping level, 
arises. 

Since the proposed SOSD requires no multiplications (note 
that hi G {±1}), we assume that soft-input Viterbi decoding 
and the SOSD for MSDD are comparable with respect to 
complexity in the number of considered nodes in the trel- 
lis, respectively the binary search tree. Hence, the overall 
complexity per processed symbol is Co = 2"^ + Csd/L, 
where the first summand is the (fixed) complexity of the 
Viterbi algorithm and the second summand represents the 
(varying) SD complexity. As the SD complexity depends on 
the ACR output, we denote the average and maximum overall 
complexity as C^"'' and C^^'^, respectively. The worst-case or 
maximum SD complexity is C^^'' = J2f=i 2* = 2^+^ - 2. 

We compare the proposed soft-output MSDD system with 
a reference system employing a 2"^'=' -states convolutional code 
and symbol-by-symbol ACR-based DD (MSDD with L = 1) 
with overall complexity C^^^ = 2"^™' + 1. For soft-output 
MSDD, at each block size the best setting ly, LLRmax is 
chosen (i.e., lowest E-^i/Nq for a desired BER), that still 
has less overall complexity than the reference system (i.e., 
^' < C^"')- For z^ref = 7 and BER = lO^^ this results in 
the trajectories shown in Fig.|5](chosen setting indicated in top 
part, similar results are obtained for other references C'^^). The 
bottom part depicts the average (solid) and maximum (dashed) 
overall complexity using the SOSD (red), and the reference 
C'^^ (dashed black). The top part shows the corresponding 
required E]^/No in dB. For comparison hard-output MSDD is 
included (blue). 

The proposed soft-output MSDD in combination with con- 
volutional codes with 2"^ < 2"'"' states can — up to MSDD 
block sizes L < 16 — ^beat the reference system, i.e., DD and 
Vref = 7. The lowest E\)/Nq, with approximately 4 dB gain 
over the reference, is achieved for L = 12. However, for 
L > 15 the complexity of the reference system is only under- 
cut by falling back to hard-decision decoding (LLRmax = 0). 
With respect to the maximum overall complexity, soft-output 
MSDD is only better than the reference system up to L = 8. 

As for each MSDD block size the required E'b/A^o of hard- 
output MSDD is higher than that of soft-output MSDD, at 
lower average, but equal maximum complexity, hard-output 
MSDD does not utilize the (anyway to be reserved) maximum 
overall complexity as good as soft-output MSDD. 

IV. Conclusions 

We have presented a noncoherent SD-based soft-output 
MSDD receiver for coded IR-UWB transmission systems. 
Based on the GLRT approach, we have derived the LLRs and 
formulated their computation as tree search problems, enabling 
the application of the SD for efficient implementation. Incor- 
porating recent results from MIMO detection, we are able to 
compute the LLRs in a single SD tree search. In combination 




60 1 — ' ' . ' ' ' . 

1 2 4 6 8 10 12 14 16 

MSDD block size L 



Fig. 5. Trajectories of performance and overall receiver complexity of the 
soft-output and hard-output MSDD IR-UWB receiver in comparison to a 
DD reference with i^,^, = 7 (C^'' = 2^ + 1) at BER = 10~^. Lables 
(i/, LLRmax) indicate chosen setting for soft-output MSDD (see text). Rate- 
1/2 conv. codes with 2" states, IEEE-CM 2. 

with further techniques for SD complexity reduction, the 
proposed soft-output SD for MSDD of IR-UWB thus imposes 
only a moderate complexity increase compared to hard-output 
MSDD. Employing this soft-output demodulator for IR-UWB, 
a large part of the gap between conventional noncoherent DD 
and ideal coherent detection can be closed. 
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